Creating Template Contract Documents using Multi- Agent Text Understanding and Clustering in Cars Insurance Domain

نویسندگان

  • Igor Minakov
  • George Rzevski
  • Petr Skobelev
  • Simon Volman
چکیده

The paper discusses problems in automated processing and classification of unstructured text information and suggests a new approach based on the multi-agent technology. The approach was applied for one of UK insurance companies to analyze 25000 documents related to car insurance domain, leading to development of a system, capable to analyze documents, classify them into hierarchical semantic structure and build a template, which includes suitable parts of all similar documents. The paper describes the system, presents testing results and discusses perspectives.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

خوشه‌بندی اسناد مبتنی بر آنتولوژی و رویکرد فازی

Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...

متن کامل

A Multi-Agent System for Distributed Cluster Analysis

One of the approaches used to improve the accuracy and relevancy in information retrieval is cluster analysis. Clustering methods determine relationships among text documents, and allow the determination of similar groups or clusters of documents. These methods are computationally expensive, thereby limiting their use to a relatively small set of documents. This paper describes a multi-agent sy...

متن کامل

روش جدید متن‌کاوی برای استخراج اطلاعات زمینه کاربر به‌منظور بهبود رتبه‌بندی نتایج موتور جستجو

Today, the importance of text processing and its usages is well known among researchers and students. The amount of textual, documental materials increase day by day. So we need useful ways to save them and retrieve information from these materials. For example, search engines such as Google, Yahoo, Bing and etc. need to read so many web documents and retrieve the most similar ones to the user ...

متن کامل

A Multi-intelligent Agent Architecture for Knowledge Extraction: Novel Approaches for Automatic Production Rules Extraction

In this paper, multi-intelligent agent architecture has been proposed for automatic knowledge extraction from its resources (domain experts and text documents). The extracted knowledge should be stored in a knowledge base to be used later by knowledge-based systems. This article aims to produce an effective knowledge base by cooperation between expert mining and text mining techniques. Firstly,...

متن کامل

Thematic clustering of text documents using an EM-based approach

Clustering textual contents is an important step in mining useful information on the web or other text-based resources. The common task in text clustering is to handle text in a multi-dimensional space, and to partition documents into groups, where each group contains documents that are similar to each other. However, this strategy lacks a comprehensive view for humans in general since it canno...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007